Using Support Vector Machines for analysis of gene expression data from DNA microarrays

نویسنده

  • K. Fujarewicz
چکیده

DNA microarrays (biochips) are a new tool that biologists can use to obtain information about expression levels of thousands of genes simultaneously. Their main advantages are: reproducibility and scalability of obtained data, short time of one experiment and, of course, the large number of genes, the expression of which is measured. The technique of producing DNA microarrays is improving continuously. In general, there are two di¤erent types of DNA microarrays: spotted microarrays and oligonucleotide microarrays. There are several important di¤erences between these two types of microarrays. One of them is the technology of the production. While spotted microarrays are obtained by using special spotting robots, oligonucleotide microarrays are synthetized, often using photolitographic technology – the same as used during production of computer chips. There are many ways of exploiting a data from microarrays. One of the most frequently used is the classi...cation of samples belonging to di¤erent classes. Such classi...cation can be applied for example in medical diagnosis and choosing proper medical therapy. One of the ...rst paper dealing with classi...cation was the article by Golub et al. (1999). In this paper samples of two types: acute myeloid leukemia (AML) and acute lymphoblastic leukemia (ALL) were classi...ed and clusterized. For classi...cation purposes the authors proposed so called weighted voting (WV) algorithm. The AML/ALL data set (available via Internet) was used by other scientists for testing di¤erent methods of analysis. For example, the same data set has been used for testing a more traditional perceptron algorithm in (Fujarewicz et al., 2000, 2001). Obtained results were slightly better than using WV algorithm. In (Furey et al., 2000) relatively new and promising method of classi...cation and regression called support vector machines (Boser et al., 1992; Vapnik, 1995; Christianini et al., 2000) has been applied to the same

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature Selection and Classification of Microarray Gene Expression Data of Ovarian Carcinoma Patients using Weighted Voting Support Vector Machine

We can reach by DNA microarray gene expression to such wealth of information with thousands of variables (genes). Analysis of this information can show genetic reasons of disease and tumor differences. In this study we try to reduce high-dimensional data by statistical method to select valuable genes with high impact as biomarkers and then classify ovarian tumor based on gene expression data of...

متن کامل

Identification of Alzheimer disease-relevant genes using a novel hybrid method

Identifying genes underlying complex diseases/traits that generally involve multiple etiological mechanisms and contributing genes is difficult. Although microarray technology has enabled researchers to investigate gene expression changes, but identifying pathobiologically relevant genes remains a challenge. To address this challenge, we apply a new method for selecting the disease-relevant gen...

متن کامل

Application of Artificial Neural Networks and Support Vector Machines for carbonate pores size estimation from 3D seismic data

This paper proposes a method for the prediction of pore size values in hydrocarbon reservoirs using 3D seismic data. To this end, an actual carbonate oil field in the south-western part ofIranwas selected. Taking real geological conditions into account, different models of reservoir were constructed for a range of viable pore size values.  Seismic surveying was performed next on these models. F...

متن کامل

O-30: Comparing Expression Patterns of Endometrial Genes in Implantation Failures and Recurrent Miscarriages with Fertile Couples Following ICSI/IVF Using in Silico Analysis

Background: To screen and diagnose patients with recurrent abortions and implantation failure after IVF/ICSI, differentially expressed genes of endometrium through DNA microarrays were monitored. Materials and Methods: Microarray expression profile of GSE26787 dataset from GEO database was used to analyze gene expression profiles of 15 endometrial biopsy samples- five from control fertile (CF) ...

متن کامل

Gene Selection for Cancer Classification Using Bootstrapped Genetic Algorithms and Support Vector Machines

The gene expression data obtained from microarrays have shown useful in cancer classification. DNA microarray data have extremely high dimensionality compared to the small number of available samples. In this paper, we propose a novel system for selecting a set of genes for cancer classification. This system is based on a linear support vector machine and a genetic algorithm. To overcome the pr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003